Scale calibration for high-dimensional robust regression
نویسندگان
چکیده
We present a new method for robust high-dimensional linear regression when the scale parameter of additive errors is unknown. The proposed estimator based on penalized Huber M-estimator, which theoretical bounds estimation error have recently been in statistics literature. However, variance term model intricately connected to optimal used define shape loss. Our main idea use an adaptive technique, Lepski’s method, overcome difficulties solving joint nonconvex optimization problem with respect location and parameters. Furthermore, by including weight definition our consistency results hold even covariates are heavy-tailed. then derive asymptotic normality one-step constructed from estimator, can be construct confidence regions subsets coordinates. shown semiparametrically efficient sub-exponential. substantially generalize previous work inference, derived under sub-Gaussian assumptions both covariate distributions.
منابع مشابه
Robust High-Dimensional Linear Regression
The effectiveness of supervised learning techniques has made them ubiquitous in research and practice. In high-dimensional settings, supervised learning commonly relies on dimensionality reduction to improve performance and identify the most important factors in predicting outcomes. However, the economic importance of learning has made it a natural target for adversarial manipulation of trainin...
متن کاملRobust Ridge Regression for High-Dimensional Data
Ridge regression, being based on the minimization of a quadratic loss function, is sensitive to outliers. Current proposals for robust ridge regression estimators are sensitive to bad leverage observations, cannot be employed when the number of predictors p is larger than the number of observations n; and have a low robustness when the ratio p=n is large. In this paper a ridge regression esti...
متن کاملOn robust regression with high-dimensional predictors.
We study regression M-estimates in the setting where p, the number of covariates, and n, the number of observations, are both large, but p ≤ n. We find an exact stochastic representation for the distribution of β = argmin(β∈ℝ(p)) Σ(i=1)(n) ρ(Y(i) - X(i')β) at fixed p and n under various assumptions on the objective function ρ and our statistical model. A scalar random variable whose determinist...
متن کاملMethods for regression analysis in high-dimensional data
By evolving science, knowledge and technology, new and precise methods for measuring, collecting and recording information have been innovated, which have resulted in the appearance and development of high-dimensional data. The high-dimensional data set, i.e., a data set in which the number of explanatory variables is much larger than the number of observations, cannot be easily analyzed by ...
متن کاملRobust high-dimensional semiparametric regression using optimized differencing method applied to the vitamin B2 production data
Background and purpose: By evolving science, knowledge, and technology, we deal with high-dimensional data in which the number of predictors may considerably exceed the sample size. The main problems with high-dimensional data are the estimation of the coefficients and interpretation. For high-dimension problems, classical methods are not reliable because of a large number of predictor variable...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Electronic Journal of Statistics
سال: 2021
ISSN: ['1935-7524']
DOI: https://doi.org/10.1214/21-ejs1936